LOCUS pDONR™/Zeo DISCLAIMER Certain terms are trademarks or registered trademarks of Invitrogen Corporation. See "Intellectual Property" in the Help file for more information. FEATURES Location/Qualifiers misc_feature complement(268..295) /note="rrnB T2 transcription termination sequence (c)" misc_feature complement(427..470) /note="rrnB T1 transcription termination sequence (c)" primer_bind 537..552 /note="M13 Forward (-20) priming site" misc_recomb 570..668 /label=attL1 /note="attL1" misc_recomb complement(3368..3464) /label=attL2 /note="attL2 (c)" misc_signal complement(3479..3498) /note="T7 Promoter/priming site (c)" primer_bind 3506..3522 /note="M13 Reverse priming site" gene 3635..4444 /note="Kanamycin resistance gene" rep_origin 4565..5238 /note="pUC origin" vector join(3376..5241,1..651) /source="pDONR%99221" /type="Donor Vector" misc_feature 669..691 /note="TEV site" source 692..3361 /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:164948 IMAGE:40148207" /tissue_type="Donated clones,Novartis FGA collection" /clone_lib="NIH_MGC_417" /lab_host="DH5a" /note="Vector: pCMV-SPORT6" gene 692..3361 /gene="GALNT5" /gene_synonym=GALNAC-T5 /db_xref="GeneID:11227" /db_xref="HGNC:4127" CDS 692..3361 /dnas_title="UDP-N-acetyl-alpha-D-galactosamine:polypeptid e N-acetylgalactosaminyltransferase 5 (GalNAc-T5)" /gene="GALNT5" /gene_synonym=GALNAC-T5 /codon_start=1 /product="UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 5 (GalNAc-T5)" /protein_id="AAI42677.1" /db_xref="GI:148745655" /db_xref="GeneID:11227" /db_xref="HGNC:4127" /translation="MNRIRKFFRGSGRVLAFIFVASVIWLLFDMAALRLSFSEINTRV IKEDIVRRERIGFRVQPDQGKIFYSSIKEMKPPLRGHGKGAWGKENVRKTEESVLKVE VDLDQTQRERKMQNALGRGKVVPLWHPAHLQTLPVTPNKQKTDGRGTKPEASSHQGTP KQTTAQGAPKTSFIAAKGTQVVKISVHMGRVSLKQEPRKSHSPSSDTSKLAAERDLNV TISLSTDRPKQRSQAVANERAHPASTAVPKSGEAMALNKTKTQSKEVNANKHKANTSL PFPKFTVNSNRLRKQSINETPLGSLSKDDGARGAHGKKLNFSESHLVIITKEEEQKAD PKEVSNSKTKTIFPKVLGKSQSKHISRNRSEMSSSSLAPHRVPLSQTNHALTGGLEPA KINITAKAPSTEYNQSHIKALLPEDSGTHQVLRIDVTLSPRDPKAPGQFGRPVVVPHG KEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEV WSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFPKVRILRLKERH GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKD MSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYF FELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKT VERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQRELRKKLKCKSFKWYLEN VFPDLRAPIVRASGVLINVALGKCISIENTTVILEDCDGSKELQQFNYTWLRLIKCGE WCIAPIPDKGAVRLHPCDNRNKGLKWLHKSTSVFHPELVNHIVFENNQQLLCLEGNFS QKILKVAACDPVKPYQKWKFEKYYEA" misc_feature 692..3361 /note="GALNT5 coding region" misc_difference 2830..2830 /gene="GALNT5" /gene_synonym=GALNAC-T5 /note="'T' in cDNA is 'C' in the human genome; no amino acid change. The chimpanzee genome agrees with the cDNA sequence, suggesting that this difference is unlikely to be due to an artifact." ORIGIN 1 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 61 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 121 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 181 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATA CGCGTACCGC 241 TAGCCAGGAA GAGTTTGTAG AAACGCAAAA AGGCCATCCG TCAGGATGGC CTTCTGCTTA 301 GTTTGATGCC TGGCAGTTTA TGGCGGGCGT CCTGCCCGCC ACCCTCCGGG CCGTTGCTTC 361 ACAACGTTCA AATCCGCTCC CGGCGGATTT GTCCTACTCA GGAGAGCGTT CACCGACAAA 421 CAACAGATAA AACGAAAGGC CCAGTCTTCC GACTGAGCCT TTCGTTTTAT TTGATGCCTG 481 GCAGTTCCCT ACTCTCGCGT TAACGCTAGC ATGGATGTTT TCCCAGTCAC GACGTTGTAA 541 AACGACGGCC AGTCTTAAGC TCGGGCCCCA AATAATGATT TTATTTTGAC TGATAGTGAC 601 CTGTTCGTTG CAACACATTG ATGAGCAATG CTTTTTTATA ATGCCAACTT TGTACAAAAA 661 AGCAGGCTct gaaaacttgt actttcaagg caggagggag cggataggat tcagagttca 721 gccagaccaa ggaaaaattt tttacagcag cataaaagag atgaaacctc ccctaagggg 781 acatgggaaa ggggcatggg gcaaagagaa tgttagaaaa actgaggaga gtgtgctcaa 841 ggttgaggtg gacttggacc aaacccagag ggaaagaaaa atgcagaatg ccctgggaag 901 gggcaaggtt gtgccgttgt ggcatcctgc acatctgcag accctccctg tgactcctaa 961 caagcagaag acagacggga gaggcaccaa acctgaagcc tcctctcacc aggggacacc 1021 aaagcaaacg acagctcagg gggctccaaa gacctcattc atagcagcaa aaggaactca 1081 ggtagtcaaa atatcagtac acatgggacg tgtcagttta aaacaggagc cccggaagag 1141 tcatagtccc agcagtgaca catcaaaact agcagctgaa agggacttga atgtgaccat 1201 cagtcttagt actgatagac caaagcagcg atcacaggca gtagcaaacg agagggcaca 1261 ccctgccagc acagcagtgc cgaagtctgg ggaagccatg gccttaaaca aaactaagac 1321 tcagagcaaa gaagtcaatg caaataaaca caaagccaat acgagtcttc cttttcctaa 1381 gttcactgtc aattcaaatc gcttaaggaa gcaatctatt aatgagacac ctttgggaag 1441 tttgtcaaag gatgatggag ctagaggggc tcatgggaag aaactcaatt tctctgaaag 1501 ccatcttgtg attataacca aagaggaaga gcaaaaggca gaccccaaag aggtctctaa 1561 ttctaaaacc aaaacaatat ttcctaaagt attgggtaaa agccaaagta aacacatttc 1621 caggaataga agtgagatgt cttcctcttc acttgctcca catagagtgc cactgtccca 1681 aactaaccat gctttaactg gagggctaga gccagcaaaa atcaacataa ctgccaaagc 1741 cccctctaca gaatacaacc agagtcatat aaaagccctt ttacctgaag acagtggaac 1801 gcaccaggtg ttaagaattg atgtgacact ttctccaagg gaccccaaag ctccagggca 1861 gtttgggcgt cctgtagttg tcccccatgg aaaggagaag gaggcagaaa gaagatggaa 1921 agaaggaaac ttcaatgtct accttagcga tttgatccca gtggatagag ccattgaaga 1981 caccagacct gctggatgtg cagagcagct agttcacaat aacctcccaa ccaccagtgt 2041 catcatgtgc tttgtggatg aagtgtggtc cactctcctg agatctgttc acagtgtcat 2101 caatcgctct cctccacacc tcatcaagga gattctgctg gtagatgact tcagcaccaa 2161 agactatcta aaagataatt tggataaata catgtcccag tttccaaaag ttcggattct 2221 tcgcctcaaa gagagacatg gcttaataag ggccaggctg gcaggagcac agaatgcaac 2281 aggtgatgtg ttgacatttt tagattctca tgtggaatgt aacgttggtt ggttggaacc 2341 tcttctggaa agagtttatt taagtagaaa gaaagtggcc tgtccagtaa tcgaagtcat 2401 caatgataag gatatgagtt acatgacagt ggataacttt caaagaggca tctttgtgtg 2461 gcccatgaac tttggttgga gaacaattcc tccagatgtc attgcaaaaa acagaattaa 2521 agaaactgat acaataaggt gccctgtcat ggctggtgga ttgttttcta ttgacaaaag 2581 ttactttttt gaacttggaa catacgaccc tggccttgat gtttggggtg gggaaaatat 2641 ggagctctca ttcaaggtgt ggatgtgtgg tggtgaaatt gagatcattc cctgctcccg 2701 agtgggccat atattcagaa atgacaatcc atattccttc cccaaagacc ggatgaagac 2761 agtggagcgg aacttggtgc gggttgccga ggtctggctg gatgagtata aggagctgtt 2821 ctatggccat ggagaccacc tcatcgacca agggctagat gttggcaacc tcacccagca 2881 aagggagctg cgaaagaaac tgaagtgcaa aagtttcaaa tggtacttgg agaatgtctt 2941 tcctgactta agggctccca ttgtgagagc tagtggtgtg cttattaatg tggctttggg 3001 taaatgcatt tccattgaaa acactacagt cattctggaa gactgcgatg ggagcaaaga 3061 gcttcaacaa tttaattaca cctggttaag acttattaaa tgtggagaat ggtgtatagc 3121 ccccatccct gataaaggag ccgtaaggct gcacccttgt gataacagaa acaaagggct 3181 aaaatggctg cataaatcaa catcagtctt tcatccagaa ctggtgaatc acattgtttt 3241 tgaaaacaat cagcaattat tatgcttgga aggaaatttt tctcaaaaga tcctgaaagt 3301 agctgcctgt gacccagtga agccatatca aaagtggaaa tttgaaaaat attatgaagc 3361 cTAGGACCCA GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT TATCAATTTG 3421 TTGCAACGAA CAGGTCACTA TCAGTCAAAA TAAAATCATT ATTTGCCATC CAGCTGATAT 3481 CCCCTATAGT GAGTCGTATT ACATGGTCAT AGCTGTTTCC TGGCAGCTCT GGCCCGTGTC 3541 TCAAAATCTC TGATGTTACA TTGCACAAGA TAAAATAATA TCATCATGAA CAATAAAACT 3601 GTCTGCTTAC ATAAACAGTA ATACAAGGGG TGTTATGAGC CATATTCAAC GGGAAACGTC 3661 GAGGCCGCGA TTAAATTCCA ACATGGATGC TGATTTATAT GGGTATAAAT GGGCTCGCGA 3721 TAATGTCGGG CAATCAGGTG CGACAATCTA TCGCTTGTAT GGGAAGCCCG ATGCGCCAGA 3781 GTTGTTTCTG AAACATGGCA AAGGTAGCGT TGCCAATGAT GTTACAGATG AGATGGTCAG 3841 ACTAAACTGG CTGACGGAAT TTATGCCTCT TCCGACCATC AAGCATTTTA TCCGTACTCC 3901 TGATGATGCA TGGTTACTCA CCACTGCGAT CCCCGGAAAA ACAGCATTCC AGGTATTAGA 3961 AGAATATCCT GATTCAGGTG AAAATATTGT TGATGCGCTG GCAGTGTTCC TGCGCCGGTT 4021 GCATTCGATT CCTGTTTGTA ATTGTCCTTT TAACAGCGAT CGCGTATTTC GTCTCGCTCA 4081 GGCGCAATCA CGAATGAATA ACGGTTTGGT TGATGCGAGT GATTTTGATG ACGAGCGTAA 4141 TGGCTGGCCT GTTGAACAAG TCTGGAAAGA AATGCATAAA CTTTTGCCAT TCTCACCGGA 4201 TTCAGTCGTC ACTCATGGTG ATTTCTCACT TGATAACCTT ATTTTTGACG AGGGGAAATT 4261 AATAGGTTGT ATTGATGTTG GACGAGTCGG AATCGCAGAC CGATACCAGG ATCTTGCCAT 4321 CCTATGGAAC TGCCTCGGTG AGTTTTCTCC TTCATTACAG AAACGGCTTT TTCAAAAATA 4381 TGGTATTGAT AATCCTGATA TGAATAAATT GCAGTTTCAT TTGATGCTCG ATGAGTTTTT 4441 CTAATCAGAA TTGGTTAATT GGTTGTAACA CTGGCAGAGC ATTACGCTGA CTTGACGGGA 4501 CGGCGCAAGC TCATGACCAA AATCCCTTAA CGTGAGTTAC GCGTCGTTCC ACTGAGCGTC 4561 AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG 4621 CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT 4681 ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTTCT 4741 TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT 4801 CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG 4861 GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC 4921 GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA 4981 GCTATGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG 5041 CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA 5101 TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 5161 GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 5221 CTGGCCTTTT GCTCACATGT T //